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1  Introduction 


Space-Time  Adaptive  Processing  (STAP)  is  a  technique  used  to  reject  interference  [1,  2], 
offering  the  potential  to  improve  airborne  radar  performance  in  several  adverse  cases  such  as 
low-velocity  target  detection  as  well  as  that  of  weak  targets  obscured  by  sidelobe  clutter.  It  is 
based  on  a  filter  that  adaptively  processes  the  radar  data  received  from  L  different  spatial 
channels  (sub-arrays  or  elements  of  a  phased  array)  and  N  pulses  in  the  coherent  processing 
interval. 

In  [3]  the  optimum  linear  processor  which  maximizes  the  output  signal  to  clutter/interference 
plus  noise  power  ratio  is  derived  under  the  assumption  of  disturbance  with  known  covariance 
matrix.  However,  since  the  clutter/interference  spectral  properties  are  not  known  a  priori  and 
may  also  vary  both  in  time  and  space,  a  structure  which  adaptively  estimates  the  weights  of  the 
aforementioned  filter  has  been  proposed  and  assessed  in  [4].  It  is  based  on  a  set  of  K  secondary 
data  collected  from  range  gates  spatially  close  to  the  one  under  test  and  sharing  the  same 
covariance.  Besides,  in  order  to  achieve  an  effective  clutter/interference  suppression,  a  large 
number  of  range  cells  (2 NL  for  achieving  an  average  perfonnance  within  3  dB  of  the  optimal 
value)  sharing  the  same  covariance  matrix  is  required.  This  is  an  important  limitation  since  in 
real  environments  the  number  of  data  in  which  the  clutter  is  homogeneous  (often  referred  to  as 
sample  support)  might  be  very  limited. 

In  order  to  circumvent  this  drawback  several  techniques  have  been  proposed  in  the  recent 
past.  They  largely  fall  into  four  distinct  classes: 

1 .  Minimal  sample  support  methods  which  include  reduced-rank  STAP  [2],  chapters  5,6, 
and  [9],  multistage  Wiener  filtering  techniques  [5],  joint-domain  localized  algorithms  [6], 
least  squares  space-time  filters  exploiting  the  property  of  finite  correlation  length  [2], 
chapter  [7],  principal  components  techniques  [7],  pre/post-Doppler  STAP  [1]  based  also 
on  a  partial  a-priori  knowledge  of  the  disturbance  scenarios,  methods  exploiting 
structural  information1  about  the  disturbance  covariance  matrix  [8,  9,  10,1 1,12],  and, 
finally,  covariance  tapers  [7]. 

2.  Knowledge-aided  STAP  employing  intelligent  training,  namely  suitable  techniques  for  a 
careful  selection  of  the  secondary  data  [13],  or  knowledge-based  criteria  [14,  15],  which 
resort  to  a-priori  information  concerning  the  surrounding  environment. 

3.  Higher-order  methods  which  include  non-linear  filters  [16,  17,  18]  to  suppress  the  clutter 
ridge  and  approximation  by  first-order  Taylor  series  of  the  non-stationarity. 

4.  Deterministic  (single  shot)  techniques  [19]. 

In  this  paper  we  propose  a  knowledge-based  approach  for  STAP  which  exploits  the  special 
structure  of  the  clutter  Power  Spectral  Density  (PSD)  [20].  Indeed  it  is  well  known  that  in  the 
absence  of  Intrinsic  Clutter  Motion  (ICM)  and  neglecting  velocity  misalignment  the  clutter  ridge 
(namely  the  locus  in  the  Doppler-angle  domain  where  the  clutter  PSD  is  sensibly  different  from 
zero)  is  composed  of  a  finite  width  line  (Doppler  unambiguous  clutter)  or  of  multiple  finite  width 
lines  (Doppler  ambiguous  clutter).  We  use  at  the  design  stage  this  a-priori  information  which 


1  These  methods  are  also  known  as  parametric  or  model-based  techniques. 
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for  the  case  of  integer  clutter  ridge  slope  fi  as  shown  in  Section  3,  is  equivalent  to  assuming  the 
knowledge  of  the  clutter  covariance  matrix  subspace. 

In  order  to  synthesize  the  receivers  we  resort  to  the  Generalized  Likelihood  Ratio  Test 
(GLRT)  and  the  two-step  GLRT  (2S-GLRT)  criteria,  and  derive  two  new  decision  rules  referred 
to  as  the  Knowledge-Based  GLRT  (KB-GLRT)  and  the  Knowledge-Based  2S-  GLRT  (KB-2S- 
GLRT)  respectively.  Actually,  the  quoted  derivations  refer  to  the  case  of  integer  /?.  Hence,  in 
order  to  deal  with  the  most  general  framework  of  non-integer  /?,  we  develop  two  approximate 
versions  of  the  aforementioned  receivers  and  discuss  the  possible  exploitation  of  further  a-priori 
information  concerning  the  clutter  covariance  subspace. 

The  paper  is  organized  as  follows.  In  Section  2  we  formulate  the  problem  and  in  Section  3 
we  provide  some  insights  about  the  clutter  covariance  matrix.  In  Sections  4  and  5  we  devise  the 
KB-GLRT  and  the  KB-2S-GLRT  respectively.  The  performance  of  the  new  receivers  is 
presented  in  Section  6  also  in  comparison  with  several  previously  devised  STAP  detectors. 
Section  7  is  devoted  to  the  generalization  of  the  novel  decision  rules  to  the  case  of  a  non-integer 
clutter  ridge  slope.  Finally,  conclusions  are  drawn  in  Section  8. 

2  Problem  Foundation 

Data  are  collected  by  a  narrowband  antenna  array  with  L  spatial  channels  which,  for 
simplicity,  we  assume  colinear,  omnidirectional,  and  equally  spaced  with  spacing  d.  Each 
channel  receives  N  echoes  corresponding  to  the  returns  of  a  coherent  pulse  train  composed  of  N 
pulses  with  a  pulse  repetition  time  Tp.  Denote  by  xh  /  =1 ;  :  :  :  ;N,  the  /.-dimensional  column 
vector  of  the  snapshots  along  the  spatial  channels  and  let 

r  =  [  x{\  xf, (NL  -  dimensional)  (1) 

with  (,)T  denoting  the  transpose  operator. 

Under  the  hypothesis  Ho,  namely  target  absence,  the  vector  r  contains  disturbance  only,  i.e. 

r=  d,  (2) 

where  d  denotes  the  overall  disturbance  accounting  for  both  clutter  and  thermal  noise. 

Under  H\,  r  contains  also  a  target  component,  i.e. 

r  =  ap  +  d ,  (3) 

with  a  denoting  the  complex  amplitude  accounting  for  both  the  target  as  well  as  the  channel 
propagation  effects  and p  is  the  target  space-time  steering  vector.  More  precisely,  the  vector  p  of 
the  returns  from  a  target  with  normalized  Doppler  frequency  cotg  and  spatial  frequency  0tg  can  be 
written  as 


P  =  a(uitg)  ®  b(6tg)  (NL  —  dimensional)  (4) 

where  ®  denotes  the  Kronecker  product, 

b(0ig)  =  [1,  exp(f27r#ts), . . . ,  e-xp(j2ir8tg(L  —  T))]r  (L  —  dimensional) ,  (5) 


and 
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a{oJt!/)  =  [1,  exp(j'27rtJt3), . . . .  cxp(f2irajtg(N  —  1))]T  (N  -  dimensional) ,  (6) 

with  j  =  \2—  1, 

In  the  sequel  we  assume  that  the  disturbance  vector  d  is  a  zero-mean  complex  circular 
Gaussian  vector  with  covariance  matrix 

R  =  Rq  +  <j2I  (7) 

where  Rc  denotes  the  clutter  covariance  matrix,  cf  the  thennal  noise  level,  and  /  is  the  identity 
matrix. 


3  On  the  Structure  of  the  Clutter  Covariance  Matrix 

According  to  [1],  neglecting  the  ICM,  the  clutter  covariance  matrix  can  be  written  as 

Nc 

=  u(6i, uii )  Wj)  (8) 

e— 1 

where  (.)f  is  the  conjugate  transpose  operator,  Nc  denotes  the  number  of  clutter  patches  involved 
in  the  scattering  mechanism,  cp,  is  a  positive  real  number  accounting  for  the  power  reflected  by 
the  7-th  clutter  patch,  9t  and  co,  denote  respectively  the  spatial  frequency  and  the  nonnalized 
Doppler  frequency  of  the  7-th  clutter  patch,  and 

u(0i,uji)  =  a(ui)  <g>  b(9i) .  (9) 

Moreover,  since  in  the  absence  of  velocity  misalignment 

2  v  t 

Wi=  d1 9i  =  m  =  (10) 

with  oa  the  platform  velocity,  we  can  rewrite  R,  as 

Rc  =  £*i  v(Qi)  ut(^i)  (H) 

i=i 

where  v(9i)  =  v( 6„  fiO,).  In  the  sequel  we  suppose  that  fd  is  an  integer,  so  that  the  Brennan  rule 
holds  with  equality  [1],  i.e. 

rank(J2c)  =  min  (r,  Nc )  ,  (12) 

with  r  =  (N-l)  P+L,  and  develop  an  alternative  expression  for  Rc.  To  this  end  we  first  prove  the 
following  proposition. 
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Proposition  1:  Denote  by  #i, . . . ,  0r,  r-distinct  real  values  complying  with  the  con¬ 
dition  61  G  [0, 1[,  l  =  1, . . . ,  r.  Then  Vi  €  {1, ... ,  Nc} 

v(9i)  =  °i,I  v&)  €  c  •  (13) 

i=i 

Proof:  Consider  the  NL  X  (r  +  1)  matrix 

V  =  [  V&), . . . ,  v(6r),  w(0<)]  (14) 

with  i  e  { 1 , . . .  ,NC }  and  recast  it  as 


1 

1 

1 

Z1 

t .  ZT 

Zi 

~L- 1 
zl 

1?L-1 

0 

Z1 

4 

—  1 
z1 

zf+L-! 

(iV-l)/3 

Z1 

SN-1)0 

(N-1)0+L-1 

Z1 

y[N-1)0+L-l 

1  * 

(N-l)p+L~l 

where  z/=  exp (j2n  Oj),  1=1,  ...  ,r,  and  Zi  =  exp (j2jr6l). 

Moreover,  since  the  rank  of  V  is  invariant  under  pennutation  of  its  rows,  rearrange  them 
so  that  the  distinct  elements  of  each  column  occupy  the  top  rows  and  the  replicated  entries 
occupy  the  bottom  rows.  This  procedure  leads  to  the  new  matrix  whose  rank,  due  to  the 
replicated  rows,  is  at  most  r  =  (N  -  1)J3  +  L. 


l 

1 

1 

2 1 

...  Zr 

Zi 

■?2 

Z1 

*  *  *  *r 

z\ 

~(N-l)0+L-l 

Z1 

y(N-l)0+L-l 

... 

(N-1)0+L~l 

i 

* 

repeated  rows 

(16) 


Finally,  since  its  first  r  columns  are  linearly  independent,  due  to  the  assumption  on  the  9/s,  then 
rank(V)  =  rank(Vi)  =  r  and  the  (r  +  l)-th  column  can  be  expressed  as  a  linear  combination  of  the 
first  r. 
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Exploiting  the  result  of  the  above  proposition  we  can  rewrite  Rc  as  follows 


Re  =  SV’i  (ZXl  ^O9*)) 

i=l  V/=l  /  \fc=l  / 


or  equivalently  as, 


r  r  Nc 


/=!  t=l 


Rc  =  v(0i)  vf(0t)  =  V(0l)  VH9t) 

1=1  t=l  i=l 

where  (•)*  is  the  complex  conjugate  operator  and 

Nc 

SM  =  ■ 

i= 1 

Finally,  in  matrix  form,  Rc  can  be  compactly  written  as 
Rc  =  ASAf 


(17) 


(18) 


(19) 


(20) 


Where  A=  v[o\  ,...,v(o)r  ,  and  S  is  a  non-negative  definite  r  x  r  hermitian  matrix.  We  stress 
the  importance  and  originality  of  the  representation  (20)  which,  due  to  the  result  of  Proposition 
1 ,  is  independent  of  the  particular  set  (d) ...,(#) r  chosen.  In  other  words,  when  [1  is  an  integer, 
the  clutter  subspace  is  known  and  coincides  with  the  subspace  spanned  by  the  columns  of  A 
irrespective  of  the  particular  choice  concerning  {d)\,  ...,(#  )r. 


4.  Derivation  of  the  KB-GLRT 


This  section  is  devoted  to  the  derivation  of  the  GLRT  for  the  hypothesis  testing  problem  of 
Section  2.  To  this  end,  as  in  [20],  we  assume  that  a  set  of  secondary  data  #•/,  ...,  rK  (K  >  NL), 
namely  vectors  from  range  cells  surrounding  the  one  being  tested,  free  of  useful  signal 
components,  and  with  the  same  spectral  properties  of  r,  is  available.2  Then  we  consider  the 
following  GLRT  decision  rule 

maxe,/(r,  fi,-,  r/dex,^)  >'  Q 
max0()/(r,  n,...,  r,f|0o,17o) 

where  0o  is  the  set  of  unknown  parameters  under  the  hypothesis  Ho,  0i  =  0o  u  {a}  is  the  set  of 
unknown  parameters  under  the  hypothesis  H\,  f(r,  ru  ...,  rKl  0i,  1 1  \)  and  f(  r,  rh  ...,  rK  I  ©o,  H0) 
denote  the  probability  density  functions  (pdfs)  of  the  data  under  the  H\  and  Ho  hypotheses 


re0  =  {S,«72},  S>  0 

\  ©1  =  (a,  3,  a2}  ,  S  >  0 


(22) 


the  GLRT  can  be  rewritten  as 

ma3Ca,ff,^/(r>  ^|a,<g,cr2,gi)  ^ 

_  maxjSo.2/(  r,  rj,...,  rK|S,  cr2,H0) 

2The  secondary  data  are  modeled  as  independent  and  identically  distributed  zero-mean  complex  Gaussian  random 
vectors. 
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respectively.  Since 

Further  developments  require  specifying  the  data  pd’s  under  both  hypotheses.  To  this  end  we 


/{  r,  ru  . . . ,  rK\a ,  S,  a2,  HJ  = 


trjCAStAt  +  cFrr'Tda)] 1 K+1 


and 


nN  det(J4SAt  +  a2 1) 

/(  r,  rjr|S,  a2,  H0)  = 


(24) 


1 _  -tr^ASA*  +  a2I)~1T0 


-xN  dot  (ASA)  +  a2 1) 


'I}™ 


(25) 


highlight  that 

where  det(-)  and  tr(-)  denote  respectively  the  determinant  and  the  trace  of  a  square  matrix,  while 


Ti{a) 


1 


K  +  1 


K 


{  r  -  ap)(  r  -  ap)f  +  ^  rk  r\ 


k= 1 


To  =  ^  r  rf  +  J2  rk 


T\  and  T0  are  defined  as 

The  maximization  of  the  denominator  of  (23)  can  be  performed  exploiting  the  result  of  [21]. 


•Ro  ,M£  =  Proc(To)  (26) 

Precisely,  the  Maximum  Likelihood  (ML)  estimator  of  R  under  the  Ho  hypothesis  is  given  by 
where  the  procedure  Proc  (•)  is  defined  in  Appendix  A. 


It  is  now  necessary  to  perform  the  maximization  of  the  numerator.  To  this  end  we  observe 
that  maximizing  (24)  over  a,  for  known  R ,  gives  the  following  equation 


plR  1  r 
ptJl-1p 

Moreover,  maximizing  (24)  over  R,  for  known  a  we  get 

R=  Proc  (Tj(a)) 


(27) 


(28) 


It  follows  that  (27)  and  (28)  are  two  sets  of  conditions  which  implicitly  define  the  joint  ML 
estimates  of  a  and  R.  Unfortunately,  it  seems  that  the  values  of  a  and  R  capable  of  jointly 
satisfying  the  two  conditions  cannot  be  expressed  in  an  explicit  form. 


However,  it  is  possible  to  define  a  recursive  procedure,  based  on  cyclic  maximizations  of 
(24),  capable  of  increasing  an  initial  value  of  the  likelihood  function.  Precisely  let  R*'*and  aU) 
be  the  estimates  of  R  and  a  at  the  /-th  step  of  the  recursion  and  evaluate. 
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(29) 


a(t+1)  =  argma xj(  r,  ru...,  JTi)  =  P  ^ J - - 

pWV1? 

and 

=  argmax jj/ (  r,  rx, . . . ,  rK |a(m),  R,  iTi)  =  Proc  (tx  (30) 

where  argmaxx  (•)  denotes  the  value  of  x  which  maximizes  the  argument. 

The  recursion  is  such  that  the  sequence  of  the  values  of  the  likelihood  function,  namely  the 
sequence 

/  (  r,  rx, . . . ,  rjcla^,  H?\  H^j 

increases  as  t  increases.  Precisely,  the  following  statement  holds  true. 

Proposition  2:  The  sequence  of  the  values  of  the  likelihood  function  is  increasing,  namely: 
Not  only  the  likelihood  sequence  is  increasing,  but  it  is  also  convergent,  namely 

/(r,  r!,...,  <f(^r,  rx,...,  rJf|a(*+1),S(W), (31) 

Proof:  In  order  to  prove  equation  (3 1)  it  is  sufficient  to  observe  that 

f(r,  ru...,  rKja(t\  <  msxf^r,r1,...,rKla,S^t),If1'j 

=  /(p,  n,..., 

<  m|x/(r,  ri,...,  rKja(t+1\  R, 

=  f  (  r,  n,..., 


Not  only  is  the  likelihood  sequence  increasing,  but  it  is  also  convergent,  namely 


lim /  (  r,  ri, ... ,  rK\a{t\  R® ,  H^j  <  +oo.  (32) 

In  fact,  given  r,  i*i,  . . .,  rK,  the  likelihood  function  is  upper-bounded. 

Starting  from  the  above  considerations  we  obtain  the  following  recursive  expression  of  the 
KB-GLRT 


f{r,  ri,...,  rjclaW.g^flx)  Q 
f{r,  rK\RatML,HQ)  ^ 


(33) 
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which  taking  the  logarithm  can  be  recast  as 


log 


det  ^-Rq, ml) 


det 


(**) 


—  tr 


(i^)  1T1(sW)-(r0,ml)  'To 


Hi 

> 

< 

Ho 


G, 


(34) 


where  G  denotes  a  suitable  modification  of  the  original  threshold. 

For  its  implementation  only  an  initial  estimate  R  is  required.  To  this  end  it  is  convenient  to 
exploit  the  ML  estimate  of  R  evaluated  from  the  secondary  data,  i.e. 


£<0> 


=  i?  =  Proc 


K 

E 

*:=! 


(35) 


A  summary  of  the  complete  algorithm  for  the  evaluation  of  the  KB-GLRT  (34)  is  given  below. 

1.  Evaluate  (35). 

2.  For  t=l,  ...,Nit compute  R^. 

3.  Perform  the  test  (34). 

Finally,  considerations  concerning  the  mathematical  tractability  of  the  problem  for  non-integer 
values  of  (3  are  given  in  Section  7. 


5.  Derivation  of  the  KB-2S-GLRT 


In  this  section  we  devise  the  2S-GLRT  detector  [22]  exploiting  the  a-priori  infonnation 
about  the  coupling  between  the  normalized  Doppler  and  the  spatial  frequencies.  Otherwise 
stated,  we  first  derive  the  GLRT  based  upon  the  primary  data  vector  r  assuming  that  the 
covariance  matrix  R  is  known.  Then  a  fully  adaptive  detector  is  obtained  substituting  the 
unknown  matrix  by  its  ML  estimate  based  upon  secondary  data  only. 


Step  1.  For  known  R  the  GLRT  based  upon  the  primary  data  is  the  following  decision  rule 


IjjIjR  1  rj2  '>1 
Pm~'P  H0 


(36) 


Step  2.  We  can  make  detector  (36)  fully  adaptive  by  plugging  the  ML  estimate  of  R,  based 
upon  the  secondary  data,  in  place  of  its  exact  value.  By  doing  so  we  come  up  with  the  following 
adaptive  decision  rule  (KB-2S-GLRT) 


P+Ra  V  Ho 


(37) 


In  Section  7  some  considerations  for  an  approximate  implementation  of  the  receiver  for  the  case 
of  a  non-integer  clutter  ridge  slope  are  provided. 
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6.  Performance  Assessment 


This  section  is  devoted  to  the  perfonnance  assessment  of  the  KB-GLRT  and  of  the  KB-2S- 
GLRT  also  in  comparison  with 

•  the  optimum  receiver  (for  targets  with  uniformly  distributed  phase)  which  assumes 
perfect  knowledge  of  the  clutter  covariance  matrix,  i.e., 

Hi 

| p^R  1  r\2  <  G  Optimum-RX.  (38) 

Ha 

•  classical  Kelly’s  receiver  and  the  Adaptive  Matched  Filter  (AMF)  which,  in  spite  of  the 
new  detectors,  do  not  exploit  the  special  structure  of  the  clutter  PSD  (the  coupling 
between  Doppler  and  spatial  frequency),  i.e. 


and 


where 


b^sr1  r|2 

Hi 

>  G  Kelly-RX , 

Ho 

(39) 

(l  +  i  nsj1  r)  p^s~lp 

\p'S?  r|>  § 
ptsrV  la 

AMF-RX, 

(40) 

s»  =  i  J2  rkr{, 

*=i 


is  the  secondary  data  sample  covariance  matrix. 

To  this  end  we  resort  to  Monte  Carlo  simulation  based  on  i°°  independent  trials.  Moreover, 

Pfa 

2 

in  order  to  limit  the  computational  burden  we  set  Pfa  =  1 0‘-.  We  also  assume  L  =  4,  N=  6,  0tg= 

0,  <x>tg  =  0.01,  and  Nc  =  1000  independent  and  identically  distributed  clutter  patches.  The 
Clutter-to-Noise  Ratio  (CNR)  is  defined  as 


CNR (41) 

<7^ 

and  is  chosen  equal  to  30  dB.  Finally,  simulation  results  have  shown  that  N„  =  1  is  sufficient  in 
order  for  the  KB-GLRT  to  achieve  convergence. 

In  Figures  1,  2,  and  3  the  probability  of  detection  (Pd)  is  plotted  versus  the  Signal  to 
Interference  plus  Noise  Power  Ratio  (SINR)  for  [1  =  I  and  several  values  of  K.  SINR  is  given 
by: 


3INR=\a\2ptR-1p, 


(42) 
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where  |-|  denotes  the  absolute  value  of  a  complex  number.  The  results  show  that  the  KB-GLRT 
outperforms  the  KB-2S-GLRT.  Both  new  detectors  exhibit  a  l\i  level  higher  than  Kelly’s 
receiver  and  the  AMF.  Moreover,  the  performance  gain  depends  on  the  size  of  the  available 
sample  support.  The  smaller  K  is  the  higher  the  gain.  Finally,  for  the  parameter  values  chosen, 
the  loss  of  the  new  receivers  with  respect  to  the  optimum  processor  is  acceptable  even  for  small 
sample  supports,  and  the  loss  decreases  as  K  increases. 

In  Figure  4  the  effect  of  Doppler  ambiguous  clutter  is  analyzed.  We  assume  (3  =  2,  K  =  3-, 
and  the  same  target  parameters  of  the  previous  three  figures.  The  curves  show  that  even  in  the 
presence  of  an  aliased  clutter  ridge  the  novel  receivers  outperform  Kelly’s  detector  and  the  AMF, 
especially  for  values  of  Pci  in  the  low-medium  range. 


7.  Analytical  Problems  Connected  with  the  Case  of  a  Non- 
Integer  (3  and  a  Possible  Approximate  Solution 

In  this  section  we  highlight  the  analytical  complications  which  were  raised  in  the  derivation 
of  the  GLRT  with  reference  to  the  case  of  a  non-integer  value  of  the  parameter  [3,  and  we 
propose  two  approximate  decision  strategies. 

To  this  end  we  observe  that  according  to  the  Brennan  rule  the  rank  of  the  clutter  covariance 
matrix  can  be  approximated  as  [1] 

rank(i£c)  =  min  (r,  Nc)  ,  (43) 


Where  r  =  round  ((A  -  1)  (3  +  L),  and  round  (•)  indicates  rounding  to  the  nearest  integer.  As  a 
consequence  the  clutter  covariance  matrix  still  admits  the  representation  in  (20),  but  Proposition 
1  does  not  hold  true.  Otherwise  stated,  the  values  0  i,  ...,9  r cannot  be  arbitrarily  chosen  and  the 
matrix  A  must  be  considered  a  function  of  the  vector  6  =  [(#)i,  ...,(#)r],  i.e.  A  =  A(9).  It 
follows  that  the  parameter  sets  under  the  Ho  and  II\  hypothesis  are 


(  00  =  {0,S,<72},  S>  o  0e[o,i[r 
{  0!  =  {a,  0,  S,  a2} ,  S>  0  0e[O,l[r 


(44) 


In  order  to  evaluate  the  GLRT  it  is  necessary  to  calculate  the  ML  estimates  of  the  parameters 
under  both  hypotheses.  Thus  we  first  focus  on  the  H\  hypothesis  and  consider  the  problem 

max  /(  r,  r1;...,  r^a,  0,  S,  a2,  Hi) .  (45) 

a,9,S ,<j* 

Maximizing  over  a,  for  fixed  R,  yields 


a  (0,  R ) 


p*R  1  r 
p^R~xp 


(46) 
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(47) 


Moreover,  maximizing  over  S  >  0  and  a2  for  fixed  a  and  ft  we  get 


R(a,  9)  =  Proc(jTi(a)) . 

I  2 

Finally,  differentiating  f(r,  r\,  ...,  rKl  a,  ft  S,  a",  //|)  with  respect  to  the  components  of  ft  for 
fixed  a,  5,  and  cr,  we  come  up  with  the  following  set  of  non-linear  equations  which  implicitly 
define  a  stationary  point: 


where 


tr  |  ^A(9)SAH9)  +  a^1  ^-SA\9)  +  A{9)S^M^ 

[(-A(0)£Mt(0)  +  u2/)"1  rj(a)  -  /] }  =  0 


i  =  1, .  •  ■ ,  r , 


dA(6 )  dv(9i)  T 
ddi  -  dSi  &i 


(48) 


(49) 


and  is  the  NL  column  vector  with  all  zero  components  but  for  the  i- th  which  is  equal  to  1 . 

The  joint  ML  estimates  of  the  parameters  must  satisfy  jointly  (46),  (47),  and  (48).  This  is  an 
apparently  mathematical  intractable  problem.  Similar  considerations  can  also  be  done  with 
reference  to  the  maximization  problem  under  the  H0  hypothesis  as  well  as  to  the  derivation  of  the 
KB-2S-GLRT  for  the  case  of  a  non-integer  beta. 

In  order  to  circumvent  this  drawback  we  notice  that  the  GLRT  strategy  (and  the  two-step 
GLRT  technique)  can  be  still  applied  in  an  approximate  fashion  if  we  assume  that  an  estimate  of 
A  is  available.  To  this  end  a  very  simple  estimate  which  could  be  adopted  is  represented  by  the 
dominant  eigenvectors  of  the  secondary  data  sample  covariance  matrix.  We  explicitly  point  out 
that  this  solution  does  not  sensibly  increase  the  computational  burden  connected  with  the 
evaluation  of  the  decision  rule  since  the  eigenvalue  decomposition  of  the  aforementioned  matrix 
must  be  evaluated  as  required  by  (35). 

The  receivers  which  exploit  the  quoted  approximation  will  be  referred  to,  in  the  sequel,  as  the 
Approximate  KB -GLRT  (AKB-GLRT)  and  the  approximate  KB -2 S -GLRT  (AKB-2S-GLRT). 
Their  detection  performance  (evaluated  by  Monte  Carlo  simulation)  is  analyzed  in  Figures  5  and 
6  also  in  comparison  with  the  optimum  receiver,  the  Kelly  receiver,  and  the  AMF  for  Pfa  =  10'2, 
L  =  4,  N=  6,  K  =  3-,  0tg  =  0,  cofe  =  0.01,  and  Nc  =  1000  independent  and  identically  distributed 
clutter  patches.  Figure  5  refers  to  p  =  0.9  whereas  Figure  6  assumes  p  =  1.1.  Finally  we  assume 
Nu  ~  1,  and  CNR  =  30  dB. 

The  curves  show  that  in  both  situations  the  AKB-GLRT  and  the  AKB-2S-GLRT  outperform 
the  Kelly  receiver  and  the  AMF.  In  particular,  for  Pci  =  0.9,  the  performance  gain  is  kept  within 
5  dB  for  both  p  =  0.9  and  p  =  1.1.  Finally,  always  for  Pd  =  0.9,  the  loss  of  the  knowledge-based 
receivers  with  respect  to  the  perfect  measurement  bound  is  about  4  db  (if  P  =  0.9)  or  5  dB  (if  P  = 
1.1). 

Notice  that  a  different  technique  to  cope  with  the  uncertainty  about  the  clutter  subspace 
might  be  the  employment  of  further  a-priori  information.  This  is  tantamount  to  assuming  the 
existence  of  a  database  containing  the  structure  of  the  covariance  subspace  for  several  different  p 
values.  Based  on  this  knowledge  the  processor  first  selects  the  actual  environment  (and  as  a 
consequence  the  specific  clutter  subspace)  and  then  evaluates  the  decision  statistics  for  testing 
the  target  presence. 


11 


8.  Conclusions 


In  this  paper  we  have  proposed  and  assessed  knowledge-based  detectors  for  STAP 
applications.  Exploiting  the  special  structure  of  the  clutter  ridge  we  have  derived  a  relevant 
property  of  the  clutter  space-time  covariance  matrix.  We  have  shown  that  for  integer  values  of 
the  slope  parameter  the  clutter  covariance  subspace  is  perfectly  known.  Thus  we  have  devised 
the  KB-GLRT  and  the  KB-2S-GLRT  which  exploit  the  aforementioned  property.  We  have  also 
addressed  the  most  general  case  of  clutter  ridge  with  non-integer  slope  and  have  shown  that  the 
problem  is  very  difficult  from  an  analytical  point  of  view  since  it  involves  the  solution  of 
multivariate  non-linear  equations.  Approximate  GLRT  detectors  have  been  thus  proposed  and 
hints  for  their  implementation  with  a  supporting  environmental  database  have  been  provided. 

The  perfonnance  analysis  has  shown  that  the  new  knowledge-based  systems  achieve  a 
performance  level  very  close  to  the  optimum  detector  which  assumes  perfect  knowledge  of  the 
clutter  covariance  matrix.  Finally,  the  new  receivers  can  also  outperform  some  previously 
proposed  adaptive  detection  schemes  such  as  the  Kelly  receiver  and  the  AMF. 
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Appendix  A:  R ml  =  Proc  ( T ) 


9  Appendix  A:  Rml  =Proc(T) 

Let 

^  ^  {x^detfASA1  +  a2 1) 


-tr 


[(ASA*  +  | 


K+ 1 


(50) 


where  T  is  a  positive  definite  matrix.  Then  the  ML  estimator  of  R—  ASA1  +  a2I,  with 
the  constraint  S  >  0,  can  be  evaluated  according  to  the  following  procedure  (Proc(T)). 

1.  Compute  the  Cholesky  factor  L  of  A1  A. 

2.  Form  the  r  x  r  matrix  X  =  L  1A1TAL  1  and  compute  its  eigendecomposition 
X  =  G$G\  where  =  diag^j., . . . ,  <f>r),  and  <j>i  >  . . .  >  <f>r. 


3.  Evaluate  7  =  tr(PJT)  where  P 1  =  I  —  A  (A*  A)  1A1. 

4.  Set  k  —  r. 

5-  Evaluate  a2  =  -j±rk  (7  +  E^+i  <Pi)  • 

6.  Verify  if3  4^+1  <  a2  <  </>£.  Yes:  go  to  Step  9. 

7.  k  =  k-  1. 


8.  Go  to  Step  5. 

9.  Set  u2ml  =  <r2- 

i  —  1, . . . ,  k 
i  =  k  4-  1, . . . ,  r 

11-  Sml  =  L-'G  AG'L-1. 

12.  Rml  =  ASmlA}  + 

We  explicitly  point  out  that  the  above  procedure  simplifies  if  the  matrix 

<51) 

In  fact,  under  this  last  assumption,  Sml  =  L  ^G  AiG^L' 1  and  Rml  =  ASmlA*  + 
o2mlI- 


10.  Define  A  —  diag(Ai, . . . ,  Ar)  where 

Av  = 


<k  ~  a2 

0 


3Use  the  definitions  <f>o  =  +oo  and  <j)T+\  —  0 
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Figure  1:  Pd  versus  SINR  for  Pfa  =  10 2,  L  =  4,  N=  6,  0,g  =  0,  ^  =  0.01,  /3=  1,  A  =  24,  Nit  =  1,  and  OV1?  =  30  dB. 
Optimum  receiver  (solid  curve),  KB-GLRT  (dotted  curve),  KB-2S-GLRT  (dashed  curve),  Kelly  receiver 
(plus-marked  curve),  and  AMF  (star-marked  curve). 
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Figure  2:  Pd  versus  SINR  for  Pfa  =  10  2,  L  =  4,  N=  6,  G,g  =  0,  =  0.01,  /3=l,K  =  48,Nit=  1,  and  CAT?  =  30  dB. 

Optimum  receiver  (solid  curve),  KB-GLRT  (dotted  curve),  KB-2S-GLRT  (dashed  curve),  Kelly  receiver 
(plus-marked  curve),  and  AMF  (star-marked  curve). 
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Figure  3:  Pd  versus  SINR  for  Pfa  =  10 2,  L  =  4,  N=  6,  G,g  =  0,  <»,„  =  0.01,  /3=1,K=  120,  iVft  =  1,  and  CNR  =  30 
dB.  Optimum  receiver  (solid  curve),  KB-GLRT  (dotted  curve),  KB-2S-GLRT  (dashed  curve),  Kelly  receiver 
(plus-marked  curve),  and  AMF  (star-marked  curve). 
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Figure  4:  Pd  versus  SINR  for  Pfa  =  10 2,  L  =  4,  N=  6,  d,g  =  0,  ^  =  0.01,  /?=  2,  K  =  30,  Nit  =  1,  and  CAT?  =  30  dB. 
Optimum  receiver  (solid  curve),  KB-GLRT  (dotted  curve),  KB-2S-GLRT  (dashed  curve),  Kelly  receiver 
(plus-marked  curve),  and  AMF  (star-marked  curve). 


19 


Figure  5:  Pd  versus  SINR  for  Pfa  =  10'2,  L  -  4,  N  =  6,  9tg  =  0,  co,g  =  0.01,  (3=  0.9,  K  =  30,  Nit  =  1,  and  CNR  =  30 
dB.  Optimum  receiver  (solid  curve),  AKB-GLRT  (dotted  curve),  AKB-2S-GLRT  (dashed  curve),  Kelly 
receiver  (plus-marked  curve),  and  AMF  (star-marked  curve). 
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Figure  6:  /^versus  SINR  for  Pfa  =  10'2,  L  =  4,  N=  6,  6tg  =  0,  co,g  =  0.01,  p=  1.1,  K=  30,  Nit  =  1,  and  CNR  =  30 
dB.  Optimum  receiver  (solid  curve),  AKB-GLRT  (dotted  curve),  AKB-2S-GLRT  (dashed  curve),  Kelly 
receiver  (plus-marked  curve),  and  AMF  (star-marked  curve). 
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